A local fingerprinting approach for audio copy detection
نویسندگان
چکیده
This study proposes an audio copy detection system that is robust to various attacks. These include the severe pitch shift and tempo change attacks which existing systems fail to detect. First, we propose a novel two dimensional representation for audio signals called the time-chroma image. This image is based on a modification of the concept of chroma in the music literature and is shown to achieve better performance in song identification. Then, we propose a novel fingerprinting algorithm that extracts local fingerprints from the time-chroma image. The proposed local fingerprinting algorithm is invariant to time/frequency scale changes in audio signals. It also outperforms existing methods like SIFT by a great extent. Finally, we introduce a song identification algorithm that uses the proposed fingerprints. The resulting copy detection system is shown to significantly outperform existing methods. Besides being able to detect whether a song (or a part of it) has been copied, the proposed system can accurately estimate the amount of pitch shift and/or tempo change that might have been applied to a song.
منابع مشابه
Robust features for content-based audio copy detection
In this paper, we present the latest improvements on spectrogram-matrix based fingerprinting system for detecting transformed audio copies. In particular, we experiment with two feature parameters derived using global and local spectrogram averages and show that combining results from these two feature parameters significantly improves performance. We test our system on TRECVID 2010 contentbase...
متن کاملITU MSPR TRECVID 2010 Video Copy Detection System
In this paper we describe the system designed by the ITU MSPR Group for content based video fingerprinting as applied to the TRECVID 2010 Content Based Copy Detection (CBCD) benchmark. This year focus of the system was on integration of audio and video fingerprinting to improve the robustness to attacks. The proposed system consists of three main modules: Audio/video fingerprint extraction, aud...
متن کاملTNO at TRECVID2008 Combining Audio and Video Fingerprinting for Robust Copy Detection
TNO has evaluated a baseline audio and a video fingerprinting system based on robust hashing for the TRECVID 2008 copy detection task. We participated in the audio, the video and the combined audio-video copy detection task. The audio fingerprinting implementation clearly outperformed the video fingerprinting implementation. We combined the audio fingerprinting results with the video fingerprin...
متن کاملCrim’s Content-based Copy Detection System for Trecvid
Approach we have tested in our submitted runs: For visualbased copy detection, we find links between video shot key-frames using a probabilistic latent space model over local matches between the keyframe images. This facilitates the extraction of significant groups of local matching descriptors that may represent common semantic elements of near duplicate key-frames. For 2009, we have worked on...
متن کاملNational Institute of Informatics, Japan at TRECVID 2010
This paper reports our experiments for three TRECVID 2010 tasks: instance search, semantic indexing, and content-based copy detection. For the instance search task, we present a simple approach that uses face-specific features for PERSON and CHARACTER queries and a combination of local and global features for the OBJECT and LOCATION queries. For the semantic indexing task, we report two approac...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Signal Processing
دوره 98 شماره
صفحات -
تاریخ انتشار 2014